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ABSTRACT The optional gronp I bstroa of the mitochon- 
drial 21S rRNA gene of Sscckmroawyces ceirnsaar coatafaa a 
235-codoo4oog open reading franc the tramlaboa product of 
wbJcb (the omugm traospocaat) catalyzes the formation of a 
table-strand break whals the iatroa-imlntu (omega ~) copies 
of the same gene. Purified omega trantpoaw geoeratcs ia rifro 
a 4- base-pair staggered art with 3' hydroxyi overhangs at the 
exact posttfcm where the mtroo rvenniaH} inserts la the gem. 
Using random); mntagcoiicd synthetic ottgooodeocides, sin- 
gle-bast mutants were prod need at 21 poamons aronnd the 
cleavage site. Experiments with these otigt»ocieocldes show 
thai the recognition she extends over an 1 Sybase pair-long 
sequence whhia which minimal ■ rqncacf d egenera c y b toler- 
ated. The mtron-encoded omega transpoaase ts, therefore, one 
of the most specific restriction endooocieases known to date. 



The group I intron of the mitochondrial 21S rRNA gene of 
Saccharomyces cerevisiae (rl intron), which is optional 
among different laboratory strains, shows the imcscsting 
property of propagating itself during crosses between intron- 
plus strains (omega*) and intron-minus ones (omega') (1- 
4). The phenomenon, which is formally equivalent to a gene 
conversion, causes rapid spreading of this particular intron 
within yeast populations undergoing random mating. Inser- 
tion of the intron is associated with a coconversion of flanking 
exons extending over a few hundred base pairs (bp) on each 
side (5). Analysis of mtDNA in young rygotes of omega * * 
omega' crosses has revealed the transient formation of a 
double-strand break occurring at or near the recipient site of 
the intron-minus gene {omega ~ site) (6. 7). Mutants that have 
an altered omega ' site do not show the double-strand break 
in similar crosses. 

The rl intron contains a 235-codon-long open reading 
frame (rl ORF) that is conserved in a variety of yeast species 
belonging to the Saccharomyces and Kltryveromyces genera 
and shares distinctive features with ORFs of other group I 
introns (8. 9). Mutants within the rl ORF easily demonstrate 
that its translation product must exist and is required for the 
gene conversion of the intron and its flanking exons (5, 10). 
Thus, the overall organization of the rl intron is reminiscent 
of a transposable element encoding its own transposasc. 

To characterize the omega transposasc, which, like the 
other intron-encoded proteins, remains undetectable in nor- 
mal mitochondria, we have previously engineered the rl ORF 
by modifying its nonuniversaj codons (11). The resulting 
universal code equivalent directs, from expression plasm ids, 
the synthesis of a protein with all characteristic properties 
expected of the genuine mitochondrial omega transposase. 
This protein exhibits a specific endonuclease activity, gen- 
erating in Escherichia coli a double -strand break at omega ~ 
sites carried on plasmids. 
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The omega transposase has already been partially purified 
from £. coli cells, and the fractions containing the partially 
purified substance can be shown to generate a double-strand 
cut within omega ~ sites ui vitro (12). Further purification to 
near homogeneity has now been achieved in our laboratory 
by following the in vitro activity of the protein. 

We have now characterized the recognition and cleavage 
site of the introtvencoded omega transposase in detail using 
in vitro assays. The protein generates a 4-bp staggered cut at 
a specific sequence, leaving free 3' hydroxy I overhangs that 
can be religaled. The recognition sequence extends over a 
continuous stretch of at least 18 bp almost exactly centered 
around the cleavage site. 

MATERIALS AND METHODS 

Strains and Plasmids. £. coli strain TGI is Mlac-pro) t sup 
E44, thi l.hsdDS. rUraD36, lac 1°, lac Z AM/5). £. coli 
growth media were made as described (13). Plasmid DNA 
preparations were made according to a reported procedure 
(14). 

Construction of Raadomly Mstafeaized omega- Sites. Ap- 
proximately 60 pmol of complementary oligonucleotides 
(Fig. 1) were hybridized and ligated to reconstitute a 59-bp- 
long fragment containing the complete omega " site (Fig. 2a). 
Oligonucleotides with free 5' termini internal to the duplex 
were phosphorylaled before hybridization to allow ligation, 
whereas oligonucleotides with 5' termini external to the 
duplex were not phosphorylaled to prevent multiple inser- 
tions. Each synthetic omega' she was then ligated between 
the Pa I and Xba I sites of the M13 tgl31 vector (Fig. lb). The 
ligation mixture was then added with 200 *d of TCM buffer 
(0.01 M Tris-HCl/0.01 M CaClj/0.01 M MgCI,) and used to 
transfect CaCl r treated £. coli cells (strain TGI) diluted and 
plated on a lawn of TGI cells to form plaques. 

Analysis of Recombinant Ml3 Clones with Randomly Mo- 
tageoized omega' Shea. Individual plaques were picked up at 
random and used to inoculate 2.5-ml cultures of TGI cells in 
dYT medium to prepare both double- and single -stranded 
DNA as described by Messing (17). For each clone, single- 
stranded DNA was sequenced by the dideoxynucleotide 
chain-termination method (18), whereas the double-stranded 
DNA was tested for cleavage by a purified omega- 
transposase (purification will be described elsewhere; A. 
Perrin. unpublished work). 

RESULTS 

The omega Transposase Generates a 4-bp Staggered Cut with 
3' Hydroxy! Overhangs at the utree^Iiawruoo Site. In yeast 
mitochondria as well as in transformed £. coli cells producing 
the omega transposasc, the double-strand cut has been shown 
to occur at or near the exact mtrorwrtsertion point within the 
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013 V TT& ft T RACMM(W^TT|tiCTpiSJUTfJlCa^T M TfiTaflCO«ftflGnCTflflCCT^ 
Olia^'ltCGTMCTMlKTTIITTTTCjr 

0»*« 3 *»»C6TMCTMTKTT(ITTT 5 ' 014c V ITCCTTTCICBTTCCMflTC 5 ' 

0t-»or62o 3 "TCMTiaaT tt cTiiT|TcctaTTaTa *' 

621 3'TC*TGCl*TCC£liIlilCCC*Tm« V 
013 3'TCI»TGCWTCCCTI>TTGTCCCail«TR 5 * 
D16 3' TC lWT6C|IHCCCT(ITT6TCCC«mT* 9" 

Fie. 1. Oliioouckocida used for the random mutagenesis of the omega- site. Oligonucleotides were synthesized using cyanoethyl 
diisopropyl phosphonnudite chemitiry oo automated DN A lymbeaztn (Applied Biosystcms, Foster City , CA, and Bioscarcb. Sen Rafael. CA) 
(15) tad were purified by get electrophoresis (16). For each obgcaucteoofe. inidexliiied bam 

potitioot. Proportion] of each of the three ones other than the wfld-type base tmiiratwi were, respectively, 2% for D13, 4.8% for G21, 11% 
for D13. and 15% for D16. In the case of D14B and D14b, wOd-typc positions A, G, and C were mutagrniTrd with 4%. 2%, and 1% of each 
of the three other bases, respectively. These proportioos have been calculated to obtain an average of one base substitution per motecuk. 



omega' site (6, 7, 11). Using purified fractions of the omega 
ttinsposase, we can now determine the precise cleavage site 
and the nature of the termini generated (sec Fig* 3). A stngk 
major radioactive band is obtained in each case, the length of 
which (75 bp and 62 bp for 5' end-labeling at 5mBI and Sd I 
sites, respectively, a«j6J bp aid 72 bp f« 3' enc^ 
I and SnaBl sites, respectively) reveals the exact location of the 
cut on each strand and indicates the formation of 3' hydroxy! 
overhangs of 4 bp. Fig, 3 also shows that the intron inserts 
exactly in the middle of the staggered cut. Comparison of the 
exact mobilhies of fragments generated by the omega transpo- 
sase with those of Maxam-Gilben sequencing reactions of the 
same DNA piece further shows thai a free hydroxy! group exists 
at the 3' end of each 4-bp extension and that each recessed 3' 
terminus must possess a phosphoryl group. In addition, results 



reveal that the cut sites measured from 3' end-labeling match 
those measured from 5' encWabeiing— showing that the omega 
transposasc gener ates a single cut on each strand without any 
other detectable activity. That the two canpiemeotary sticky 
ends can be reUgated with each other using T4 DN A ligase(daxa 
not shown) further confirms the single-cut hypothesis. 

The Recognita Site of the easel* Tra^aposaat fcaeada Over 
18 bp. To define the sequence recognized by the protein and 
to determine its specificity, we generated a scries of single- 
base substitutions in a 30-bp fragment centered around the 
cleavage site as described in Figs. 1 and 2. 

Of 243 independent clones sequenced from six different 
hybridizing experiments, we found 97 single mutants (40%), 
63 double mutants (27%), and 22 triple or quadruple mutants 
(9%). The remaining clones were wild type. These values 
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Fic. 2. Omstrocuoo of randomly nuitafenized omega' sites Synthetic 59-bp fragments reconstituting the omef~ site and I nankins 
sequences were constructed by hybridizing oligonucleotides containing randomly muuf»mi^^ p^uocs (black sectors, see F* 1) as 
indicated [a). The 7234-bp Pu \-Xba 1 fragment from M13 1|131 vector (Amersham) was srl purified and Ugated wrth the artificial omega sites 
ib). After traiufeciioa of £. coti TGI. each recombinant clone was used to prepare both doabtt strand and tingle-strand DMA. 
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Fig. 3. Naturt of the cleavage site. For mapping of the 3' termini: DNA from pUsmid pSCM 522 (a pUC 2 denvative thai <^™»JJ* 
omega' site) w U restricted either at iu unique 5a/ I she \a) or ai tu unique JimBI site ib), dcphosphoryUted and 5 ™ 
polynucleotide kinase and ( r "P]ATP. After secondary cleavage, the single end-labeled Set WmBI fragment was gel purified and then 
submitted to the action of the omegc transpose. For trapping of the 3' termini. DNA from pUsmid pSCM 522 ^ Imewued aj I either tu uruque 
Pu | site (c) or at iu unique J/wBI site (<0 and 3' end-labeled with terminal deo«yniKkoodyHrans/era»e and |«-»P)ddATP. ^ h | e ~™*^ 
DNA was then submitted to the omega digestion. After incubaboo. the fragments were ekctrophoresed on an J* f^^^ 1 *?"—^!* 
sequencing gel calibrated using Maxam-Gilbert reactions of the same DNA fragments 5' end-labeled other « the* 5a/ I site <« •»« « <*J« 
5na Bl site (6 and <T). Dou and numben indicate size of labeled fragments in nucleotides. The sequence is P»nofLr»e 21S rRNA gene and shows 
the introo insertion point ( o) and the cleavage of each strand < ♦ >. The upper strand is identical to the 215 rRNA. 



approximate the expected random distribution for an average, 
of one substitution per oligonucleotide molecule. However, 
strong biases in distribution of the base substitutions were 
seen— some bases being mutated much more frequently than 
others. Pan of this bias is probably from the trmnsfection itself 
because results from independent iransfection experiments 
from the same hybridizing mixture differed (data not shown). 
Finally, we isolated a total of 32 different single -base substi- 
tutions in the segment studied, some mutations being found 
several times among the 97 single mutants obtained. These 
selected mutants were plaque purified and sequenced again. 
Fig. 4 shows the sequence of all representative mutants. 
Mutations are available in 20 positions of a 25-bp segment. 
The omega" j mutation previously isolated in vivo in yeast 
mitochondria (19) provides a 21st position. 

Double-strand DN As were purified from all single mutants 
as well as from some double mutants and were exposed to 
omega transposase action. Wild-type clones served as con- 
trols. Under our experimental conditions, digestion of the 
wild-type omega' site is only partial (-50%) because the 
active fraction contains only a limited amount of omega 
transposase relative to the total amount of DNA used in the 
assays (A. Perrin. unpublished work). Cleavage efficiency at 
the mutant site of each clone was estimated by ethidium 
bromide fluorescence of fragments and referred to the effi- 
ciency at the wild-type site under similar conditions (Fig. 5). 
Three kinds of mutants can be recognized: those in which 
cleavage is undistinguishable from the wild type ( + ), those 
that are not cleaved at all (0), and those that are cleavable but 
with lower efficiency than the wild type it). This last category 
contains a range of intermediate cleavages, some mutants 
(e.g., 357, 653. 711, 369. and 810) being only slightly affected, 
whereas others are severely affected. Comparisons of reac- 
tions performed with independent mutants having identical 
nucleotide sequences as well as independent reactions done 



on the same mutant show the reproducibility of the experi- 
ment and the validity of this classification. We also tested the 
significance of the previous classification using three enzyme 
concentrations (1 x , 3 x , and 6x ) resulting, respectively, in 
cleavage of 50%. 80%, and 99% of the wild-type site. Under 
such conditions, a mutant belonging to the 0 class (318) 
remained completely uncleaved. whereas two mutants be- 
longing to the e class (357 and 840) showed significantly 
reduced cleavage (5%, 25%, and 50% for. respectively, the 
1 x , 3 x , and 6 x enzyme concentrations). 

The three kinds of mutants are indicated in Fig. 4, which 
shows the following: <i) Mutants affecting cleavage occur 
from position -7 (mutant 155) to position + 11 (mutant 147), 
indicating that the recognition site of the omega transposase 
extends at least over 18 bp. The absence of mutants at 
position - 8 and after position + 11 does not permit any more 
precise limits on the recognition site; (u) recognition of the 
omega ~ site by omega transposase involves a continuous 
stretch within this 18-bp segment because at least one mutant 
exists at every position (except + 9 and + 10) of the 0 or * 
type: tiii) within this 18-bp segment very few mutations (826 
and 516) do not affect cleavage. 

DISCUSSION 

Our results demonstrate that an tntron-encoded protein 
catalvzes the formation of a site-specific douWe-strand break 
within a gene, leaving 4-bp sticky ends with 3' hydroxyl 
overhangs like those produced by class II restriction endo- 
nucleases (20). The position of the break coincides exactly 
with the intron-inscrtion site, mdkating it as the initiating 
event of the whole mtron-insenion process. Thus the orga- 
nization of this intron is reminiscent of a transposaWe 
element encoding its own transposase (21)— except that the 
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Fic. 4. Mutational analysis of ihc recognition sit*. Sequence of the 21S rRNA get* "round the omega i>r**t» ™ J?*"™" 
indicates the iniron msenion point in the middle of Ok staged cut (thick line). The three poaaibk bp substitution, (teft boiet) at «verypo*uoo 
are shown: the black botes correspond to wild-type sequence. Number* refer to mutants and refer, to mutant. «» '"J^lLi- 

Efficiency of cleavage is as defined in F.g. 3: + . mutant cleaved as wild-type by the omega irmnsposa^-. 0. no * ^ X^!"*! 

«. reduced level of cleavage by the omega transposase. For the two S mutants, the cleavage eft^eycy (0) has been 4*™**™ 
experiments in yeast 10'). Mutants 121 and 520 have the following alterations in addition to the one indicated: 121. a 1-bp dekuon at position 
- U; 520. a A — T transversion at position - 15- 



laner intron always insens in the same recipient site and does 
not duplicate bases at that site. 

The described phenomenon is analogous in several ways 
with the yeast matins type interconversion catalyzed by the 
HO endonuclcase (22) and can be interpreted by the double- 
strand break -repair model (23, 24). However, for the omega 
system, degradation of preexisting sequences around the 
cleavage site is not required. It follows thai one free 3' 
extension generated by omega transposase can be a primer to 
elongate a copy of the iniron sequence as shown in Fig. 6. 
This model differs from the classic double-strand break- 
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Fic. 5. Assays for cleavage by the omega transposase. Assays 
were done in standard restriction bufTer. 0.02 M Tris-HCl/6 mM 
MfCU/0 0J M NaCI/1 mM DL-dithiothreiiol. pH 7.5, containing 
bovine jerum albumin at 100 Mg/ml. The double-stranded DNA was 
first digested by da I for 1 hr and then treated with 2-5 of purified 
fractions of omega transposase for another hour at 3TC. Sizes of the 
Cta I restriction fragments of the M13 recombinants arc indicated in 
bp on the left of the gel: fragment sizes generated after cleavage by 
the omega transposase are indicated on the right. The latter corre- 
spond to the cleavage, at the omega' site, of the 4370-bp-tong 
fragment. Cleavage efficiency was estimated from the ratio of the 
3760-bp fragment to the * 370- bp fragment. Lanes: a and c. mutants 
of the 0 type; b. mutant of the * type; and d, mutant of the + type. 



repair model by the facts that (/) this model requires special 
ends, (if) recombi national intermediates include the omega 
transposase, and (Hi) a single-strand intermediate exists in 
this model. 

Our results also demonstrate that the recognition site of the 
omega transposase extends over at least 18 bp. Most base 
substitutions within these 18 bp either completely abolish 
cleavage or significantly reduce cleavage efficiency, indicat- 
ing that the omega transposase tolerates minimal sequence 
degeneracy. The 29 single-base mutants isolated from the 54 
possible ones (54%) span 17 positions of the 18 bp forming the 
recognition site. Because no selection was applied in the 
protocol, it is reasonable to assume that type distribution of 
the remaining 25 possible mutants would resemble that 
already found. Thus, the omega endonuckase is much more 
specific than any other bacterial restriction cndonucleasc 
known so far (26). The recognition site of the HO endonu- 
clcase also extends over an 18-bp-long segment, but the 
degree of degeneracy tolerated is less precisely known due to 
fewer available mutants (27). Interestingly, the only other 
eukaryotic double-strand endonucleases known to date, all 
from yeast (HO. See 1, and See II). generate 4-bp extensions 
with 3' overhangs (22. 28). Yet, the sequences cleaved by 
these proteins do not obviously resemble one another or the 
omega' site. Thus, the omega transposase may eventually 
become a useful restriction enzyme for specific experiments 
such as genomic mapping and cloning. 

Mutations within the recognition site that reduce or abolish 
cleavage can be interpreted in either of two ways: alteration 
of the binding of the protein to the site or binding without 
cleavage. Preliminary experiments using one of the omega 9 
mutants favor the second possibility (A. Perrin, unpublished 
work). The large size of the recognition site does not mean 
that the omega transposase. which is a relatively small 
protein (235 amino acids), actually contacts ail 18 bp of the 
recognition site. Mutations at some positions may alter the 
conformation of the helix at nearby positions involved in the 
contacts. Precise protein-DNA interactions need to be stud- 
ied. The A repressor (29), the X integrate <3<J), or the resolvase 
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Fig. 6. Hypothetical mechanism of imron insertion based on the 
nature of the cut. The figure stresses that the 4-bp extension 
(5'ATAAqm ) can serve directly as primer (o > for elongation of the lop 
strand by DNA polymerase after a strand-assimilation process 
similar to that mediated by RecA protein in E. coii (15) ib\ broken 
line). The top strand is identicaJ to the 21S rRNA. When the newly 
synthesized strand reaches the downstream eson it can. in turn, be 
assimilated by the corresponding sequence from the spiii omega ' 
duplex ib) and serves as matrix for the eiongaiion of the bottom 
strand (c). This elongation, however, requires the caonucleoJytic 
degradation of. at least, the *-bp extension (5TTATom) on the 
bottom strand. Resolution of the intermediate could be either by 
branch migration (as shown), leaving an intact omega * duplex, or by 
cleavage of the two Hoi liday junctions, forming ■ hybrid between the 
old and the new strand. Lowercase letters and uppercase letters 
represent sequences of the rl intron and of the exons. respectively. 
Shadowed section represents a hypothetical complex thai hoWs the 
split omega' duplex together with the omega* duplex; the omega 
transposase could be contained in such a complex. 

family (31) probably offer better analogies for the omega 
transposase-DNA interactions than bacterial type II restric- 
tion endonucleases. The omega transposase. however, dif- 
fers from these models for at least three reasons: (0 the 
omega' site is nonsymmetrical; 00 secondary binding sites 
arc unlikely h/i vitro cleavage occurs even if the omega ' site 
is transported into a completely new environment); and (iii) 
no recombinogenic activity of the omega transposase could 
be found in experiments involving replicons with two 
omega' sites (12). 

The extreme sequence specificity shown by the intron- 
encoded omega transposase is puzzling with respect to the 
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small size of the mitochondrial genome and what advantage, 
if any. the yeast gains by synthesizing such a specific 
endonuclease. If the role of the intronic protein were to 
propagate the intron by introducing it at new genetic loca- 
tions, then too high a specificity could only limit efficiency of 
the process. On the other hand, synthesizing double-strand 
endonucleases with a relatively low specificity is obviously 
deleterious for a cell without the corresponding modification 
mechanism. Bacteria have evolved sophisticated restriction- 
modification systems, whereas eukaryotcs have not. Perhaps 
we should not be surprised that the few eukaryooc endonu- 
cleases characterized to date are so specific. 
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